Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 2804 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 241.1 KiB |
| Average record size in memory | 88.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 9 |
df_index has a high cardinality: 2804 distinct values | High cardinality |
open is highly correlated with high and 3 other fields | High correlation |
high is highly correlated with open and 3 other fields | High correlation |
low is highly correlated with open and 3 other fields | High correlation |
adj_close is highly correlated with open and 3 other fields | High correlation |
volume is highly correlated with open and 3 other fields | High correlation |
RSI_14 is highly correlated with STO_14 | High correlation |
STO_14 is highly correlated with RSI_14 | High correlation |
open is highly correlated with high and 3 other fields | High correlation |
high is highly correlated with open and 3 other fields | High correlation |
low is highly correlated with open and 3 other fields | High correlation |
adj_close is highly correlated with open and 3 other fields | High correlation |
volume is highly correlated with open and 3 other fields | High correlation |
RSI_14 is highly correlated with STO_14 | High correlation |
STO_14 is highly correlated with RSI_14 | High correlation |
open is highly correlated with high and 2 other fields | High correlation |
high is highly correlated with open and 2 other fields | High correlation |
low is highly correlated with open and 2 other fields | High correlation |
adj_close is highly correlated with open and 2 other fields | High correlation |
RSI_14 is highly correlated with STO_14 | High correlation |
STO_14 is highly correlated with RSI_14 | High correlation |
RSI_14 is highly correlated with CHO and 2 other fields | High correlation |
CHO is highly correlated with RSI_14 and 5 other fields | High correlation |
volume is highly correlated with CHO and 5 other fields | High correlation |
STO_14 is highly correlated with RSI_14 | High correlation |
high is highly correlated with CHO and 4 other fields | High correlation |
open is highly correlated with CHO and 4 other fields | High correlation |
adj_close is highly correlated with CHO and 4 other fields | High correlation |
return is highly correlated with RSI_14 and 1 other fields | High correlation |
low is highly correlated with CHO and 4 other fields | High correlation |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
CHO has unique values | Unique |
STO_14 has 362 (12.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-06-16 04:20:58.474991 |
|---|---|
| Analysis finished | 2021-06-16 04:21:32.707220 |
| Duration | 34.23 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 2804 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.0 KiB |
| 2017-10-25 | 1 |
|---|---|
| 2020-10-23 | 1 |
| 2014-11-12 | 1 |
| 2014-03-05 | 1 |
| 2010-07-22 | 1 |
| Other values (2799) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 28040 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 2804 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2010-01-22 |
|---|---|
| 2nd row | 2010-01-26 |
| 3rd row | 2010-01-27 |
| 4th row | 2010-01-28 |
| 5th row | 2010-01-29 |
Common Values
| Value | Count | Frequency (%) |
| 2017-10-25 | 1 | < 0.1% |
| 2020-10-23 | 1 | < 0.1% |
| 2014-11-12 | 1 | < 0.1% |
| 2014-03-05 | 1 | < 0.1% |
| 2010-07-22 | 1 | < 0.1% |
| 2018-06-28 | 1 | < 0.1% |
| 2017-01-20 | 1 | < 0.1% |
| 2019-03-14 | 1 | < 0.1% |
| 2012-04-20 | 1 | < 0.1% |
| 2014-03-17 | 1 | < 0.1% |
| Other values (2794) | 2794 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2017-10-25 | 1 | < 0.1% |
| 2020-10-23 | 1 | < 0.1% |
| 2014-11-12 | 1 | < 0.1% |
| 2014-03-05 | 1 | < 0.1% |
| 2010-07-22 | 1 | < 0.1% |
| 2018-06-28 | 1 | < 0.1% |
| 2017-01-20 | 1 | < 0.1% |
| 2019-03-14 | 1 | < 0.1% |
| 2012-04-20 | 1 | < 0.1% |
| 2014-03-17 | 1 | < 0.1% |
| Other values (2794) | 2794 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6750 | |
| - | 5608 | |
| 1 | 5161 | |
| 2 | 5025 | |
| 3 | 922 | 3.3% |
| 8 | 777 | 2.8% |
| 4 | 766 | 2.7% |
| 6 | 762 | 2.7% |
| 5 | 762 | 2.7% |
| 7 | 760 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 22432 | |
| Dash Punctuation | 5608 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6750 | |
| 1 | 5161 | |
| 2 | 5025 | |
| 3 | 922 | 4.1% |
| 8 | 777 | 3.5% |
| 4 | 766 | 3.4% |
| 6 | 762 | 3.4% |
| 5 | 762 | 3.4% |
| 7 | 760 | 3.4% |
| 9 | 747 | 3.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5608 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28040 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6750 | |
| - | 5608 | |
| 1 | 5161 | |
| 2 | 5025 | |
| 3 | 922 | 3.3% |
| 8 | 777 | 2.8% |
| 4 | 766 | 2.7% |
| 6 | 762 | 2.7% |
| 5 | 762 | 2.7% |
| 7 | 760 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6750 | |
| - | 5608 | |
| 1 | 5161 | |
| 2 | 5025 | |
| 3 | 922 | 3.3% |
| 8 | 777 | 2.8% |
| 4 | 766 | 2.7% |
| 6 | 762 | 2.7% |
| 5 | 762 | 2.7% |
| 7 | 760 | 2.7% |
| Distinct | 1988 |
|---|---|
| Distinct (%) | 70.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.79322753 |
| Minimum | 36.32 |
|---|---|
| Maximum | 119.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 36.32 |
|---|---|
| 5-th percentile | 45.953 |
| Q1 | 53.1 |
| median | 62.055 |
| Q3 | 77.64 |
| 95-th percentile | 109.41 |
| Maximum | 119.89 |
| Range | 83.57 |
| Interquartile range (IQR) | 24.54 |
Descriptive statistics
| Standard deviation | 19.36664502 |
|---|---|
| Coefficient of variation (CV) | 0.2856722673 |
| Kurtosis | -0.1051485656 |
| Mean | 67.79322753 |
| Median Absolute Deviation (MAD) | 10.2 |
| Skewness | 0.9348437557 |
| Sum | 190092.21 |
| Variance | 375.0669393 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 56 | 13 | 0.5% |
| 55 | 9 | 0.3% |
| 53 | 9 | 0.3% |
| 50 | 8 | 0.3% |
| 58 | 8 | 0.3% |
| 69 | 8 | 0.3% |
| 57.5 | 7 | 0.2% |
| 52.5 | 7 | 0.2% |
| 68 | 7 | 0.2% |
| 70.4 | 7 | 0.2% |
| Other values (1978) | 2721 |
| Value | Count | Frequency (%) |
| 36.32 | 1 | < 0.1% |
| 36.49 | 1 | < 0.1% |
| 36.6 | 1 | < 0.1% |
| 36.75 | 1 | < 0.1% |
| 37.19 | 1 | < 0.1% |
| 37.48 | 1 | < 0.1% |
| 37.51 | 3 | |
| 37.73 | 1 | < 0.1% |
| 37.85 | 1 | < 0.1% |
| 38.01 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 119.89 | 1 | |
| 119.75 | 1 | |
| 119.46 | 1 | |
| 119.44 | 1 | |
| 119.05 | 1 | |
| 118.94 | 1 | |
| 118.79 | 1 | |
| 118.49 | 1 | |
| 118.27 | 1 | |
| 118.26 | 1 |
| Distinct | 2044 |
|---|---|
| Distinct (%) | 72.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.41776034 |
| Minimum | 36.61 |
|---|---|
| Maximum | 120.89 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 36.61 |
|---|---|
| 5-th percentile | 46.5 |
| Q1 | 53.7275 |
| median | 62.775 |
| Q3 | 78.67 |
| 95-th percentile | 110.601 |
| Maximum | 120.89 |
| Range | 84.28 |
| Interquartile range (IQR) | 24.9425 |
Descriptive statistics
| Standard deviation | 19.4604303 |
|---|---|
| Coefficient of variation (CV) | 0.2844353601 |
| Kurtosis | -0.1017436866 |
| Mean | 68.41776034 |
| Median Absolute Deviation (MAD) | 10.355 |
| Skewness | 0.9387675829 |
| Sum | 191843.4 |
| Variance | 378.7083475 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 56 | 13 | 0.5% |
| 65 | 9 | 0.3% |
| 60 | 9 | 0.3% |
| 63 | 9 | 0.3% |
| 55 | 8 | 0.3% |
| 59 | 7 | 0.2% |
| 63.99 | 7 | 0.2% |
| 52.99 | 7 | 0.2% |
| 49.35 | 7 | 0.2% |
| 50.1 | 6 | 0.2% |
| Other values (2034) | 2722 |
| Value | Count | Frequency (%) |
| 36.61 | 1 | |
| 36.75 | 1 | |
| 36.96 | 1 | |
| 37.3 | 1 | |
| 37.51 | 1 | |
| 37.63 | 1 | |
| 37.68 | 1 | |
| 37.85 | 1 | |
| 38.01 | 1 | |
| 38.33 | 1 |
| Value | Count | Frequency (%) |
| 120.89 | 1 | |
| 120.66 | 1 | |
| 120.15 | 1 | |
| 119.97 | 1 | |
| 119.93 | 1 | |
| 119.83 | 1 | |
| 119.58 | 1 | |
| 119.5 | 1 | |
| 119.44 | 1 | |
| 119.29 | 1 |
| Distinct | 2097 |
|---|---|
| Distinct (%) | 74.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.09756063 |
| Minimum | 36.01 |
|---|---|
| Maximum | 119.53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 36.01 |
|---|---|
| 5-th percentile | 45.123 |
| Q1 | 52.4975 |
| median | 61.56 |
| Q3 | 76.97 |
| 95-th percentile | 108.639 |
| Maximum | 119.53 |
| Range | 83.52 |
| Interquartile range (IQR) | 24.4725 |
Descriptive statistics
| Standard deviation | 19.25837955 |
|---|---|
| Coefficient of variation (CV) | 0.2870205619 |
| Kurtosis | -0.09038330103 |
| Mean | 67.09756063 |
| Median Absolute Deviation (MAD) | 10.31 |
| Skewness | 0.936066514 |
| Sum | 188141.56 |
| Variance | 370.8851831 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 55 | 9 | 0.3% |
| 56.5 | 7 | 0.2% |
| 51.7 | 6 | 0.2% |
| 55.5 | 6 | 0.2% |
| 56 | 6 | 0.2% |
| 54 | 6 | 0.2% |
| 59 | 6 | 0.2% |
| 57.5 | 6 | 0.2% |
| 69.5 | 5 | 0.2% |
| 54.5 | 5 | 0.2% |
| Other values (2087) | 2742 |
| Value | Count | Frequency (%) |
| 36.01 | 1 | |
| 36.05 | 1 | |
| 36.2 | 1 | |
| 36.24 | 1 | |
| 36.63 | 1 | |
| 36.69 | 1 | |
| 36.75 | 1 | |
| 36.82 | 1 | |
| 36.83 | 1 | |
| 37.36 | 1 |
| Value | Count | Frequency (%) |
| 119.53 | 1 | |
| 118.79 | 2 | |
| 118.57 | 1 | |
| 118.05 | 1 | |
| 117.96 | 1 | |
| 117.84 | 1 | |
| 117.81 | 1 | |
| 117.74 | 1 | |
| 117.72 | 1 | |
| 117.48 | 1 |
| Distinct | 2028 |
|---|---|
| Distinct (%) | 72.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.68993937 |
| Minimum | 36.45 |
|---|---|
| Maximum | 120.72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 36.45 |
|---|---|
| 5-th percentile | 45.853 |
| Q1 | 52.9775 |
| median | 62 |
| Q3 | 77.725 |
| 95-th percentile | 109.4485 |
| Maximum | 120.72 |
| Range | 84.27 |
| Interquartile range (IQR) | 24.7475 |
Descriptive statistics
| Standard deviation | 19.40675783 |
|---|---|
| Coefficient of variation (CV) | 0.2867007714 |
| Kurtosis | -0.09932680875 |
| Mean | 67.68993937 |
| Median Absolute Deviation (MAD) | 10.3 |
| Skewness | 0.9396392631 |
| Sum | 189802.59 |
| Variance | 376.6222496 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 55 | 10 | 0.4% |
| 54 | 8 | 0.3% |
| 56 | 8 | 0.3% |
| 64.01 | 7 | 0.2% |
| 70 | 6 | 0.2% |
| 53.5 | 6 | 0.2% |
| 51.8 | 6 | 0.2% |
| 70.25 | 5 | 0.2% |
| 55.05 | 5 | 0.2% |
| 51.52 | 5 | 0.2% |
| Other values (2018) | 2738 |
| Value | Count | Frequency (%) |
| 36.45 | 1 | |
| 36.5 | 1 | |
| 36.6 | 1 | |
| 36.75 | 1 | |
| 36.85 | 1 | |
| 36.86 | 1 | |
| 37.4 | 1 | |
| 37.48 | 2 | |
| 37.52 | 1 | |
| 37.75 | 1 |
| Value | Count | Frequency (%) |
| 120.72 | 1 | |
| 120.4 | 1 | |
| 119.59 | 1 | |
| 119.45 | 1 | |
| 119.26 | 1 | |
| 119.24 | 1 | |
| 118.68 | 1 | |
| 118.62 | 1 | |
| 118.4 | 1 | |
| 118.37 | 1 |
| Distinct | 2260 |
|---|---|
| Distinct (%) | 80.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2898018.166 |
| Minimum | 0 |
|---|---|
| Maximum | 45899510 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 216838 |
| median | 1830010 |
| Q3 | 3961882.5 |
| 95-th percentile | 10100073.3 |
| Maximum | 45899510 |
| Range | 45899510 |
| Interquartile range (IQR) | 3745044.5 |
Descriptive statistics
| Standard deviation | 3865365.685 |
|---|---|
| Coefficient of variation (CV) | 1.333796223 |
| Kurtosis | 17.16526454 |
| Mean | 2898018.166 |
| Median Absolute Deviation (MAD) | 1829894.5 |
| Skewness | 3.152217722 |
| Sum | 8126042938 |
| Variance | 1.494105188 × 1013 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 34 | 22 | 0.8% |
| 26 | 19 | 0.7% |
| 24 | 17 | 0.6% |
| 25 | 17 | 0.6% |
| 12 | 17 | 0.6% |
| 28 | 17 | 0.6% |
| 15 | 16 | 0.6% |
| 22 | 16 | 0.6% |
| 21 | 15 | 0.5% |
| 29 | 15 | 0.5% |
| Other values (2250) | 2633 |
| Value | Count | Frequency (%) |
| 0 | 2 | 0.1% |
| 1 | 4 | |
| 2 | 2 | 0.1% |
| 3 | 4 | |
| 5 | 2 | 0.1% |
| 6 | 7 | |
| 7 | 8 | |
| 8 | 5 | |
| 9 | 5 | |
| 10 | 7 |
| Value | Count | Frequency (%) |
| 45899510 | 1 | |
| 37160660 | 1 | |
| 34289940 | 1 | |
| 31237700 | 1 | |
| 30097830 | 1 | |
| 28912530 | 1 | |
| 28728360 | 1 | |
| 28398250 | 1 | |
| 27526870 | 1 | |
| 26825567 | 1 |
| Distinct | 2787 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.79138584 |
| Minimum | 14.38390501 |
|---|---|
| Maximum | 82.2701558 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 14.38390501 |
|---|---|
| 5-th percentile | 34.07193908 |
| Q1 | 44.37680073 |
| median | 51.40153149 |
| Q3 | 59.39856181 |
| 95-th percentile | 69.91217465 |
| Maximum | 82.2701558 |
| Range | 67.8862508 |
| Interquartile range (IQR) | 15.02176107 |
Descriptive statistics
| Standard deviation | 10.91424478 |
|---|---|
| Coefficient of variation (CV) | 0.2107347507 |
| Kurtosis | -0.13726454 |
| Mean | 51.79138584 |
| Median Absolute Deviation (MAD) | 7.42926744 |
| Skewness | -0.003104048169 |
| Sum | 145223.0459 |
| Variance | 119.1207391 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 57.31903207 | 2 | 0.1% |
| 42.91456736 | 2 | 0.1% |
| 58.39071959 | 2 | 0.1% |
| 39.88547646 | 2 | 0.1% |
| 60.00618112 | 2 | 0.1% |
| 73.86329655 | 2 | 0.1% |
| 47.58823253 | 2 | 0.1% |
| 37.83420067 | 2 | 0.1% |
| 32.99216982 | 2 | 0.1% |
| 46.75749372 | 2 | 0.1% |
| Other values (2777) | 2784 |
| Value | Count | Frequency (%) |
| 14.38390501 | 1 | |
| 19.19322028 | 1 | |
| 19.23250512 | 1 | |
| 20.24601379 | 1 | |
| 20.43519901 | 1 | |
| 20.88636622 | 1 | |
| 21.04064535 | 1 | |
| 21.04553096 | 1 | |
| 21.21497632 | 1 | |
| 22.13958574 | 1 |
| Value | Count | Frequency (%) |
| 82.2701558 | 1 | |
| 82.11275583 | 1 | |
| 81.66817866 | 1 | |
| 81.63783152 | 1 | |
| 81.43844668 | 1 | |
| 81.30189157 | 1 | |
| 80.94410619 | 1 | |
| 80.46090085 | 1 | |
| 79.61027223 | 1 | |
| 79.43850683 | 1 |
| Distinct | 1941 |
|---|---|
| Distinct (%) | 69.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.86221418 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 362 |
| Zeros (%) | 12.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 20.16578184 |
| median | 59.22604219 |
| Q3 | 91.46362728 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 71.29784545 |
Descriptive statistics
| Standard deviation | 36.46953461 |
|---|---|
| Coefficient of variation (CV) | 0.6647477713 |
| Kurtosis | -1.415159177 |
| Mean | 54.86221418 |
| Median Absolute Deviation (MAD) | 34.79673529 |
| Skewness | -0.2058783452 |
| Sum | 153833.6486 |
| Variance | 1330.026954 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 100 | 476 | 17.0% |
| 0 | 362 | 12.9% |
| 4.782608696 | 3 | 0.1% |
| 46.66666667 | 2 | 0.1% |
| 44.48818898 | 2 | 0.1% |
| 50.66964286 | 2 | 0.1% |
| 9.124087591 | 2 | 0.1% |
| 70 | 2 | 0.1% |
| 16.75977654 | 2 | 0.1% |
| 43.17269076 | 2 | 0.1% |
| Other values (1931) | 1949 |
| Value | Count | Frequency (%) |
| 0 | 362 | |
| 0.1886792453 | 1 | < 0.1% |
| 0.2066115702 | 1 | < 0.1% |
| 0.2222222222 | 1 | < 0.1% |
| 0.2564102564 | 1 | < 0.1% |
| 0.3205128205 | 1 | < 0.1% |
| 0.4415011038 | 1 | < 0.1% |
| 0.4514672686 | 1 | < 0.1% |
| 0.5509641873 | 1 | < 0.1% |
| 0.5540166205 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 476 | |
| 99.78902954 | 1 | < 0.1% |
| 99.75124378 | 1 | < 0.1% |
| 99.7382199 | 1 | < 0.1% |
| 99.69512195 | 1 | < 0.1% |
| 99.68553459 | 1 | < 0.1% |
| 99.5862069 | 1 | < 0.1% |
| 99.51690821 | 1 | < 0.1% |
| 99.49874687 | 1 | < 0.1% |
| 99.46091644 | 1 | < 0.1% |
| Distinct | 2804 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49673.18421 |
| Minimum | -11779644.18 |
|---|---|
| Maximum | 11872803.46 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1524 |
| Negative (%) | 54.4% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | -11779644.18 |
|---|---|
| 5-th percentile | -3704335.069 |
| Q1 | -544780.5208 |
| median | -40.61082477 |
| Q3 | 668540.8088 |
| 95-th percentile | 3722691.593 |
| Maximum | 11872803.46 |
| Range | 23652447.64 |
| Interquartile range (IQR) | 1213321.33 |
Descriptive statistics
| Standard deviation | 2280116.031 |
|---|---|
| Coefficient of variation (CV) | 45.9023529 |
| Kurtosis | 5.006596298 |
| Mean | 49673.18421 |
| Median Absolute Deviation (MAD) | 609239.8522 |
| Skewness | -0.1008284634 |
| Sum | 139283608.5 |
| Variance | 5.198929115 × 1012 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 747584.0568 | 1 | < 0.1% |
| -28.29142569 | 1 | < 0.1% |
| -63104.34273 | 1 | < 0.1% |
| 30.11442388 | 1 | < 0.1% |
| -53.33327633 | 1 | < 0.1% |
| 1465195.806 | 1 | < 0.1% |
| -1424209.889 | 1 | < 0.1% |
| -11.83789577 | 1 | < 0.1% |
| -3357827.818 | 1 | < 0.1% |
| 2325051.64 | 1 | < 0.1% |
| Other values (2794) | 2794 |
| Value | Count | Frequency (%) |
| -11779644.18 | 1 | |
| -11755122.58 | 1 | |
| -10641752.02 | 1 | |
| -10635639.38 | 1 | |
| -10441979.78 | 1 | |
| -9892370.229 | 1 | |
| -9850856.831 | 1 | |
| -9666719.398 | 1 | |
| -9389014.873 | 1 | |
| -9318371.057 | 1 |
| Value | Count | Frequency (%) |
| 11872803.46 | 1 | |
| 11441018.89 | 1 | |
| 10906629.97 | 1 | |
| 10892257.92 | 1 | |
| 10743244.47 | 1 | |
| 10475236.13 | 1 | |
| 9734150.04 | 1 | |
| 9710380.095 | 1 | |
| 9457318.155 | 1 | |
| 9378587.134 | 1 |
| Distinct | 2767 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0004091337009 |
| Minimum | -0.1457470144 |
|---|---|
| Maximum | 0.1339514979 |
| Zeros | 25 |
| Zeros (%) | 0.9% |
| Negative | 1351 |
| Negative (%) | 48.2% |
| Memory size | 22.0 KiB |
Quantile statistics
| Minimum | -0.1457470144 |
|---|---|
| 5-th percentile | -0.02762426288 |
| Q1 | -0.009295309773 |
| median | 0.000326768495 |
| Q3 | 0.01029545987 |
| 95-th percentile | 0.02950649635 |
| Maximum | 0.1339514979 |
| Range | 0.2796985122 |
| Interquartile range (IQR) | 0.01959076964 |
Descriptive statistics
| Standard deviation | 0.01962155952 |
|---|---|
| Coefficient of variation (CV) | 47.95879555 |
| Kurtosis | 6.860064297 |
| Mean | 0.0004091337009 |
| Median Absolute Deviation (MAD) | 0.009767486183 |
| Skewness | -0.2057492295 |
| Sum | 1.147210897 |
| Variance | 0.0003850055979 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 25 | 0.9% |
| -0.01851851852 | 3 | 0.1% |
| -0.01818181818 | 2 | 0.1% |
| 0.002 | 2 | 0.1% |
| -0.00018178513 | 2 | 0.1% |
| 0.02459016393 | 2 | 0.1% |
| 0.004618937644 | 2 | 0.1% |
| 0.003115264798 | 2 | 0.1% |
| -0.00495049505 | 2 | 0.1% |
| 0.01553166069 | 2 | 0.1% |
| Other values (2757) | 2760 |
| Value | Count | Frequency (%) |
| -0.1457470144 | 1 | |
| -0.1395144043 | 1 | |
| -0.123940678 | 1 | |
| -0.1171471927 | 1 | |
| -0.09703258693 | 1 | |
| -0.09477848101 | 1 | |
| -0.08927272727 | 1 | |
| -0.08663707332 | 1 | |
| -0.08642192166 | 1 | |
| -0.08628276258 | 1 |
| Value | Count | Frequency (%) |
| 0.1339514979 | 1 | |
| 0.1155036095 | 1 | |
| 0.1121408712 | 1 | |
| 0.1023879621 | 1 | |
| 0.1012060829 | 1 | |
| 0.09310527244 | 1 | |
| 0.08695652174 | 1 | |
| 0.08225806452 | 1 | |
| 0.07170495768 | 1 | |
| 0.07071639586 | 1 |
Target
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.0 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8412 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1429 | |
| 0.0 | 1375 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1429 | |
| 0.0 | 1375 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4179 | |
| . | 2804 | |
| 1 | 1429 | 17.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5608 | |
| Other Punctuation | 2804 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4179 | |
| 1 | 1429 | 25.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4179 | |
| . | 2804 | |
| 1 | 1429 | 17.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4179 | |
| . | 2804 | |
| 1 | 1429 | 17.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | open | high | low | adj_close | volume | RSI_14 | STO_14 | CHO | return | Target | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2010-01-22 | 65.60 | 65.96 | 64.91 | 65.59 | 653711 | 26.550868 | 0.000000 | -230526.994114 | -0.004704 | 0.0 |
| 1 | 2010-01-26 | 65.05 | 65.36 | 64.12 | 64.98 | 457095 | 24.549947 | 0.000000 | -153660.903303 | -0.009300 | 0.0 |
| 2 | 2010-01-27 | 64.99 | 65.00 | 63.90 | 64.90 | 444017 | 24.291395 | 0.000000 | 7344.691172 | -0.001231 | 1.0 |
| 3 | 2010-01-28 | 65.06 | 66.49 | 63.98 | 65.20 | 243961 | 27.380053 | 5.882353 | 70378.105506 | 0.004622 | 0.0 |
| 4 | 2010-01-29 | 65.50 | 65.99 | 64.70 | 64.90 | 129331 | 26.227741 | 0.000000 | 61375.649488 | -0.004601 | 1.0 |
| 5 | 2010-02-01 | 65.40 | 66.16 | 64.79 | 66.05 | 172290 | 37.147644 | 22.549020 | 98129.628841 | 0.017720 | 1.0 |
| 6 | 2010-02-02 | 66.95 | 66.95 | 65.99 | 66.70 | 221228 | 42.342590 | 35.294118 | 137973.362470 | 0.009841 | 0.0 |
| 7 | 2010-02-03 | 66.50 | 66.65 | 66.20 | 66.59 | 269839 | 41.714231 | 36.739130 | 204692.472063 | -0.001649 | 0.0 |
| 8 | 2010-02-04 | 66.10 | 66.21 | 63.12 | 63.40 | 286058 | 28.503826 | 0.000000 | 138855.046062 | -0.047905 | 0.0 |
| 9 | 2010-02-05 | 62.31 | 63.49 | 60.76 | 62.00 | 680707 | 24.792907 | 0.000000 | 79464.247554 | -0.022082 | 1.0 |
Last rows
| df_index | open | high | low | adj_close | volume | RSI_14 | STO_14 | CHO | return | Target | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2794 | 2021-05-17 | 117.07 | 118.41 | 117.06 | 118.31 | 5279182 | 60.340676 | 100.000000 | 2.077768e+06 | 0.009127 | 0.0 |
| 2795 | 2021-05-18 | 118.09 | 118.88 | 117.74 | 118.19 | 5456773 | 59.835979 | 97.619048 | 2.173268e+06 | -0.001014 | 0.0 |
| 2796 | 2021-05-19 | 117.07 | 118.54 | 116.94 | 118.01 | 4843732 | 59.038297 | 94.047619 | 2.534916e+06 | -0.001523 | 0.0 |
| 2797 | 2021-05-20 | 118.05 | 118.14 | 117.48 | 117.91 | 4833202 | 58.571139 | 92.063492 | 2.918428e+06 | -0.000847 | 1.0 |
| 2798 | 2021-05-21 | 118.01 | 118.13 | 117.10 | 118.01 | 5169068 | 58.921191 | 94.047619 | 4.071479e+06 | 0.000848 | 1.0 |
| 2799 | 2021-05-24 | 118.26 | 119.50 | 117.96 | 119.26 | 5788644 | 63.116431 | 100.000000 | 5.440808e+06 | 0.010592 | 0.0 |
| 2800 | 2021-05-25 | 119.89 | 119.97 | 118.05 | 118.40 | 5697654 | 58.676499 | 78.446115 | 4.354429e+06 | -0.007211 | 1.0 |
| 2801 | 2021-05-26 | 118.94 | 119.58 | 118.79 | 119.24 | 4716894 | 61.523545 | 99.498747 | 3.723120e+06 | 0.007095 | 1.0 |
| 2802 | 2021-05-27 | 119.46 | 119.83 | 118.79 | 119.59 | 6038746 | 62.677378 | 100.000000 | 4.161002e+06 | 0.002935 | 1.0 |
| 2803 | 2021-05-28 | 119.75 | 120.89 | 119.53 | 120.72 | 5991423 | 66.201422 | 100.000000 | 5.391634e+06 | 0.009449 | 1.0 |